中图分类
执行
    中文(共12篇) 外文(共15458篇)
    排序:
    导出 保存至文件
    [机翻] 2017年学生集群竞赛,NTHU团队:在Intel Skylake和Nvidia P100架构上复制tersoff多体势矢量化
    摘要 : Markus Hohnerbach et al. recently published a work to optimize the performance of Tersoff potential, which is a computing scheme used in the LAMMPS molecular dynamics (MD) code. The optimization solver was implemented with three d... 展开

    [机翻] 2018年学生集群竞赛,清华大学团队:英特尔天湖架构上2004年苏门答腊海啸大推力地震的多物理模拟再现性能
    摘要 : As a special activity of the Student Cluster Competition at SC18 conference, we made an attempt to reproduce the performance evaluations of an optimized version of earthquake simulation software SeisSol. Our experiments were condu... 展开

    摘要 : With growing applications such as image recognition, speech recognition, ADAS, and AIoT, artificial intelligence (AI) frameworks are becoming popular in various industries. Currently, many choices for neural network frameworks exi... 展开
    关键词 : AI model compilers   NNEF   NNAPI  

    摘要 : Over the past decade, deep convolutional neural networks (CNN) have been widely embraced in various visual recognition applications owing to their extraordinary accuracy. However, their high computational complexity and excessive ... 展开

    摘要 : Embedded multicore systems are playing increasingly important roles in the design of consumer electronics. The objective of such systems is to optimize both performance and power characteristics of mobile devices. However, current... 展开

    [期刊]   Chen, Tai-Liang   Chen, Yi-Ru   Yu, Meng-Shiun   Lee, Jenq-Kuen   《Journal of supercomputing》    2021年77卷8期      共31页
    摘要 : Deep learning compiler tool, Tensor Virtual Machine (TVM), has excellent deployment, compilation, and optimization capabilities supported by the industry following the vigorous growth in neural networks (NN). It has a unified inte... 展开

    [机翻] 嵌入式多核系统设计模式的低功耗编译器
    摘要 : Minimization of power dissipation can be considered at algorithmic, compiler, architectural, logic, and circuit level. Recent research trends for multicore programming models have come to the direction that parallel design pattern... 展开

    [机翻] 在LLVM上为PTX模拟器支持opencl2.0编译器
    摘要 : Heterogeneous systems that consist of multiple CPUs and GPUs for high-performance computing are becoming increasingly popular, and OpenCL (Open Computing Language) provides a framework for writing programs that can be executed acr... 展开
    关键词 : OpenCL   Gem5-gpu   LLVM   Libclc   PTX  

    摘要 : Currently, GPGPU-Sim has become an important vehicle for academic architecture research. It is a cycle-accurate simulator that models the contemporary graphics processing unit. Machine learning has now been widely used in various ... 展开
    关键词 : Low-power numerical   GPGPU   Simulator  

    [期刊]   Yang, Shun-Ren   Yuan, Shih-Chun   Lin, Yi-Chun   Yang, I-Fen   《Mobile networks & applications》    2022年27卷1期      共12页
    摘要 : With the progress of medical science and technology and the healthy changes in eating habits, the proportion of aged population is gradually increasing. Smart-home elderly care has thus attracted a lot of research attention in the... 展开

    研究趋势
    相关热图
    学科分类